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ABSTRACT 

We have analyzed a set of deep ROSAT observations with a total sky coverage 
of 40 square degrees to search for clustering of faint X-ray sources. Using the 
resulting catalog of discrete X-ray sources, we detect, for the first time in X-rays, 
a positive correlation on angular scales of 0^5-10'. When corrected for a bias due 
to limited spatial resolution which amplifies the correlation, the observed angular 
correlation function agrees well with that expected from the spatial correlation 
of optically selected quasars, provided that they comprise an appreciable fraction 
(> 50%) of detected X-ray sources. 
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1. INTRODUCTION 

Models of the X-ray background composition can be strongly constrained by the study of 
its arcminute-scale fluctuations. For example, Soltan & Hasinger (1994, hereafter SH94) have 
shown that a highly correlated population of point sources cannot produce more than 35% 
of the still unresolved fraction of the X-ray background. Until now the angular correlation 
of X-ray sources has not directly been measured, and the usual approach in X-ray studies is 
to assume a spatial correlation model for X-ray sources inferred from optical observations. 
In this Letter we report the first detection of a positive angular correlation of X-ray sources 
detected in a set of medium-deep exposure ROSAT fields covering a total area of 40 square 
degrees. The correlation is detected on both small 1') and moderate (I'-IO') angular 
scales. 

This work continues our series of papers (Vikhlinin et al. 1995a,b,c, hereafter Papers I, II, 
III) in which we developed an efficient method for detection of point sources (Matched Filter, 
Paper I), and applied it to a set of 130 long exposure high Galactic latitude ROSAT/PSPC 
images. Papers II and III discuss the number-flux relation and average spectra of faint X-ray 
sources. In the present study, we explore the spatial distribution of X-ray sources by means of 
the two-point angular correlation function. We show that the detected angular correlation is 
consistent with that expected if X-ray sources are distributed in space like optically selected 
quasars. We compare our results to the angular correlation of the fluctuations in the X-ray 
background (e.g., Georgantopoulos et al. 1993 and SH94). 

2. DATA AND ANALYSIS 

Our sample consists of 254 medium and deep exposure ROSAT observations all hav- 
ing high Galactic latitudes (|6| > 30°) and low galactic absorption (A^h < 6 x 10^° cm~^). 
When possible, we stacked several ROSAT pointings in the same direction, thus increasing 
the exposure for several flelds. The data were screened for high particle and solar scattered 
background periods (Snowden et al. 1994). Special precautions were taken to exchidc inter- 
vals of bad satellite aspect. The exposure times range from ~ 5, 000 s to ~ 100, 000 s, with 
minimum detectable fluxes from ~ 5 x 10^^"^ ergs s^^ cm^^ to ~ 1.2 x 10^^^ ergs s^^ cm~^ 
for the shortest and longest exposure flelds, respectively. The effective sky coverage is 40 
square degrees at bright fluxes, begins to decrease below 5 x 10"^^ ergs s~^ cm~^, and remains 
signiflcant down to at least ~ 4 x 10~^^ ergs s"^ cm~^, where the coverage is 1 square degree. 

Compared to our previous study of the X-ray number-flux relation and average source 
spectra (Papers II, III), we have doubled the geometrical sky coverage, predominantly by 
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inclusion of a large number of lower exposure fields. On the other hand, any pointing 
which had a normal galaxy as its target, which could potentially contaminate the sample by 
inclusion of highly correlated and/or extended sources related to those galaxies, has been 
excluded. For the purposes of the present analysis, we used only the central parts (< 12' off- 
axis) of ROSAT images, excluding also a 3' radius circle around the targets of observations. 
Data analysis was performed in exactly the same way as described in Papers 1 and 11. 

The conventional way to determine the two-point angular correlation function (ACF) 
is to calculate the ratio of the number of pairs of sources in the real data (DD), which fall 
in the separation range of {6,9 + dO), and the number of those pairs expected when the 
spatial distribution of sources is random (RR): w{6) = ^ — 1. Since the source detection 
efficiency decreases far off the telescope axis (primarily because of the PSF degradation), 
the appropriate way to determine RR is to simulate observations with the background and 
the source number-flux relation found in actual observations, apply the detection algorithm 
to the simulated images, and compute RR as the number of detected source pairs found in 
the simulated data. 

Simulations were performed according to the following procedure. The source positions 
were chosen uniformly at random. Source fluxes were then determined from the log N - log S 
distribution measured in Paper II (A^( > S) cc S~^'^^ for S > 2.2 x 10~^^ ergs s~^ cm~^ 
and N{ > S) cc S~^'^^ below this flux). The differential log N - log S was truncated at 
a limiting flux, /min = 3 x 10^^^ ergs s^^ cm~^, which is almost three orders of magnitude 
below the detection threshold of the deepest observation, and corresponds to a source surface 
density of ~ 10 per PSF circle. At the limiting flux value, the number of photons per source 
is much less than 1, so that the contribution from sources below fmm is indistinguishable 
from truly diffuse emission. Source fluxes were converted to counts (after accounting for the 
telescope vignetting function) using the counts-to-flux conversion coefficient calculated for an 
absorbed a — 1 power-law spectrum, with the absorption flxed at the appropriate Galactic 
value for each ROSAT observation. The number of source photons was drawn from a Poisson 
distribution. Photons were distributed over the images according to the PSF approximation 
of Hasinger et al. (1993). 

To achieve the requirement of having the same background in the real and simulated 
data, an additional diffuse component must be added to the simulated images. This simulates 
various PSPC background components - particle, scattered solar, truly diffuse CXB, etc. The 
required diffuse component was determined through an iterative procedure as follows: (0) 
generate a "diffuse map" which is initially set to zero; (1) simulate sources as described in the 
previous paragraph; (2) add the Poisson-scattered "diffuse map"; (3) detect point sources 
and calculate the background map for the simulated image; (4) add the difference between 
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the background maps of the simulated and real images to the "diffuse map" ; repeat steps 
(l)-(4) until the difference between the background maps of the simulated and real images 
is negligible. 

Each ROSAT observation was simulated 5 times. The simulated data were formatted 
identically to the actual observations and processed in exactly the same manner as the real 
data. Several tests were used to check the quality of the simulated data. First, the log N - 
log S distribution determined from simulations was found to be in good agreement with the 
input one. Next, we checked that wc do detect the same number of sources in the simulated 
and real data (both for the whole data set and for various subsets like short/long exposure, 
on- axis/off-axis etc.). In each case we found good agreement between the real data and 
simulations; for example, the total number of sources in the real data is 2158, while the 
number of sources averaged over the simulations is 2199 (which is less than la difference). 

The ACF was calculated as w 

(0) = ^ - I, Error bars for the ACF were determined 
assuming Poisson statistics (Peebles 1980). The resulting ACF is shown in Fig. 1. In 
the angular separation range of 25"-100" there is an ~ 4(7 detection (238 pairs found in 
the real data, 182 pairs are expected from simulations), and an ~ 3.5(7 detection between 
100" and 500" (4513 pairs in the real data and 4276 in simulations). The power law model 
w{9) — {6/9qY~'' yields best fit parameters 7 = 1.7±0.3 and a correlation angle ^0 = 10"±8" 
(one-parameter 68% confidence intervals); if the value of 7 is fixed at 1.8 (the value found for 
normal galaxies and optically selected quasars), the error in 6q is much smaller: 6q — 10"±2". 

What is the origin of the positive correlation which we detect? Potential contributors to 
the small-scale correlation include extended sources like clusters of galaxies, nearby groups 
or even single galaxies of large angular size because they can be detected as multiple point 
sources. However, we have compiled the sample so that it does not contain such objects as 
targets of observations. In addition, we have searched through NED and SIMBAD and found 
that very few catalogued clusters, groups or nearby galaxies are contained in the inner field 
of view for the observations we used. Visual inspection of close pairs has shown that in only a 
few cases are the sources hkely to be related to a single extended source; they are not "special" 
objects such as weak "sources" in the wings of the PSF around very bright sources or ghost 
images. Also, the detection of correlation at large separations (100"-500") does not support 
the hypothesis that the correlation is due to the presence of a number of extended sources. 
Finally, the (relatively well-constrained) power law slope of the ACF is fully consistent with 
the power law slope of 1.8 found for normal galaxies (e.g. Davis & Peebles 1983) and the 
suggested value for optically selected quasars (Shanks & Boyle 1994). Thus it is likely that 
the observed correlation arises from clustering of QSOs, the most numerous class of faint X- 
ray sources (Boyle et al. 1993). Below we argue that the amphtude of the angular correlation 
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expected from the spatial correlation of optically selected quasars is in good agreement with 
our mcasTircments, when the latter are appropriately corrected for the amplification bias 
which arises from the limited PSPC spatial resolution. 

3. CORRECTION FOR THE AMPLIFICATION BIAS 

Unfortunately, the best fit correlation angle = 10" obtained for the directly measured 
ACF of the detected sources is smaller than the FWHM of the ROSAT PSPC PSF, which is 
~ 25" on-axis. This implies that sources separated by less than ~ 20" are detected as a single 
object. Thus the distribution of detected sources is quite different from the distribution of 
real sources on the sky. It is clear that the spatial distribution of detected sources is close to 
the distribution of peaks in the pattern obtained by convolution of the distribution of real 
sources with the PSPC PSF. This is the origin of the amplification bias, which results in 
a more clustered distribution of peaks compared to the distribution of underlying objects. 
This effect was studied by Kaiser (1984) as applied to the correlation function of clusters 
of galaxies. Kaiser has shown that if one smoothes the galaxy distribution with a Gaussian 
window of ~ 10 Mpc size, the correlation function of peaks ( "clusters" ) is biased with respect 
to the correlation function of galaxies: ^clusters (^") = ^Cgaiaxies('")- For reasonable values of 
model parameters, the amplification factor A can be as large as ~ 10. Therefore, we can 
expect that our results may be strongly affected by the amplification bias. 

To determine the influence of the amplification bias, we have repeated the Monte-Carlo 
simulations described in section 2, with correlated input source positions. The sources with 
a power-law angular correlation function were simulated using the 2-dimensional version of 
the algorithm, described in Soneira & Peebles (1978). We fixed the power law slope of the 
input correlation function and adjusted its amplitude so that the ACF of detected sources 
(measured as described in section 2) is equal to that in the real data. For the simulations, 
we know the positions of both input and detected sources, so that we can determine the 
influence of the amphfication bias. The ACFs of input and detected sources are shown in 
Fig 2. The power law shape of the input ACF was unaffected by the PSF and the detection 
algorithm, while the amplitude of the correlation function of detected sources is increased 
by a factor of 2.85. To check that the difference is indeed caused by the amplification bias, 
and not by an error in our detection algorithm, we also measured the ACF of peaks in the 
pattern obtained by the convolution of input source positions with the PSF. This is shown by 
open circles in Fig 2. The agreement of the ACF for detected sources and peaks is excellent. 
The success of this test makes us confident that we are indeed dealing with the amplification 
bias. The correction factor of 2.85 must be applied to the measured ACF in order to obtain 
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the unbiased estimate. With this correction, the correlation angle for the underlying source 
population is 6q = 4". In the following section we compare this corrected ACF with that 
expected for the sources clustered like optically selected quasars. 



4. COMPARISON WITH OPTICAL AND OTHER X-RAY STUDIES 

Most studies of QSO clustering show that quasars are indeed strongly clustered at 
z ~ 1 — 2. Although the reports on clustering evolution are controversial, there is a general 
agreement about the clustering amplitude at these redshifts - at a comoving scale of 10h~^ 
Mpc the amplitude of the two-point correlation function is ~ 1 (see Shanks & Boyle 1994 
for a recent analysis). In the calculations below, we make a common assumption that the 
spatial two-point correlation function at any redshift is given by 

ar,z) = (r/ro)-^ {l + z)-''~' (1) 

with the parameter e describing clustering evolution, for example e ^ 8 — 7 ~ —1.2 corre- 
sponds to the clustering which is constant in comoving coordinates. For a power law spatial 
correlation function, the angular correlation function is also a power law w{6) = {0/9qY~"' 
with amplitude (Peebles 1980) 

.1-7 _ ,-7 rr fElV C y'-' dy <P{yf (1 + z{y))-^-^+yF{y) 
' '\c) [S^y^dyc^{y)/F{y)f ' 

where = r(|) 1(2^)/ r(f ), and F{y) = [1 - (i^a^)^ (^] - 1)]V2. The parameter y is 
related to the coordinate distance and redshift through 

y = ^°"°c = ' iniTz) ' 

and (j){y) determines the fraction of sources observable at the given y (or z), i.e. those with 
observed fluxes greater than the detection threshold Srain- The cosmological model enters 
through the z - y relation (eq. |]), volume factor F{y), and the selection function (p{y). 
The latter also depends on the source luminosity function evolution. Boyle et al. (1993) 
derive the cosmological evolution of QSOs in the form of pure luminosity evolution, with 
the characteristic luminosity scaling with redshift as (1 + z)^. For an object with luminosity 
L{E) at redshift z, the observed energy flux is / = L{E{1 + z)){l + z)/AttD1, where 
is the luminosity distance. For the power law energy spectrum E~°' with a = 1 (which is 
close to what is observed, see Paper III), L{E{1 + z)){l + z) = L{E) so that the minimum 
luminosity is related to the minimum detectable flux simply as Lmin = 47r Dj^ fmin- The 
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selection function can be written as 



/•oo 

m = / ^{L,z{y))dL 



j 

J 4 



oo 



^L, 0) dL, 



(4) 



where -z) is the QSO luminosity function at redshift z, and -R(-z) is the ratio of the 
typical luminosity at a redshift z to that at the present epoch. Given the spatial correlation 
function (eq. |l|), the present day QSO luminosity function $(L, 0) and the evolution rate 
R{z), eqns. 2-4 enable one to calculate the angular correlation amplitude, ^o- Of numerous 
parameters in eqns. 1-4, the angular correlation function most strongly depends on the spatial 
clustering scale vq, the rate of clustering evolution e, and the power law slope 7. There is 
an indication that the value of 7 is almost "universal", i.e. clustering of different classes of 
objects like galaxies, galaxy groups, clusters of galaxies, QSOs, is well described by a power 
law with indices close to 1.8, while the clustering scales tq are very different (see Bahcall 
1988 for a review). As we showed aboved, 7 = 1.8 is also in good agreement with the 
angular correlation of X-ray sources we report in this Letter. We hence fix 7 at the value 
of 1.8 and calculate the angular correlation function for different values of tq and e. We 
then compare the predicted and measured ACF, to constrain the parameters of the spatial 
correlation function. We assume that Q = 1 and the luminosity function and its evolution 
are those given by Boyle et al. (1993). The 90% confidence regions of parameters, calculated 
for two values of the QSO fractionj^ are shown in Fig. 3. For simplicity of comparison of 
our results with the analysis of SH94, the clustering scale in Fig. 3 is expressed in units of 
the correlation length at z = 1.5. The grey-shaded region in this figure shows the range of 
clustering parameter values measured at 2; = 1 — 1.5 for optically selected quasars (Shanks 
& Boyle 1994). For any reasonable fraction of quasars in our X-ray sample, 0.5 < f < 1.0, 
the X-ray and optical data are in good agreement. 

We also compare our results with other studies of the X-ray background fluctuations. 
The angular correlation function of the unresolved background in ROSAT Deep Survey fields 
has been measured by Georgantopoulos et al. (1993) and SH94 who detect a positive cor- 
relation on small angular scales (< 3'). At larger separations rather tight upper limits on 
the angular correlation are obtained, and the general consensus (e.g., Carrera & Barcons 
1992, Danese et al. 1993, SH94) is that a population of sources which are clustered like 



2 If the fraction of clustered sources /o < 1, and other sources are randomly distributed, 
the ACF of the total sample equals the ACF of clustered sources times /q; if there are two 
populations both clustered, but independent from each other, the ACF must be multiplied 
by a factor /q < / < 1. In our calculations we assume that QSOs are clustered, while 
other sources are not. 
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QSOs cannot produce more than 30% — 50% of the still unresolved XRB. For example, Fig. 
8 of SH94 shows that the isotropy of the residual XRB requires that a population of sources 
which contribute 100% of the residual XRB must have a correlation length Xq < 2h~^ Mpc 
at z = 1.5. This is significantly below the correlation length of both QSOs and the X-ray 
sources in our sample. Thus, there is a considerable discrepancy: on the one hand, directly 
detected X-ray sources are highly correlated (their clustering is consistent with ~ 80% of 
them being spatially distributed like QSOs), while the residual XRB (after subtraction of 
detected sources) appears very smooth. This may imply significant changes in the nature of 
sources contributing to the XRB at faint fiux levels (close to the sensitivity limit of ROSAT 
Deep surveys). Narrow emission fine galaxies identified in Deep surveys (e,g., Boyle et al. 
1995) may be promising candidates for a new population. Also, one cannot exclude a signif- 
icant contribution of truly diffuse emission to the unresolved XRB. Hasinger et al. (1993b) 
showed that the contribution of a diffuse component up to 25% of the total XRB flux is not 
excluded. Wang & McCray (1993) found that 40% of the XRB in the 0.5 - 0.9 keV band 
arises from a diffuse thermal component. Both results indicate that a significant fraction 
of the unresolved XRB may be truly diffuse, thus making it very smooth compared to the 
clustering found for detected sources. 

We have made a rough estimate of the contribution of the observed point source cor- 
relation to that reported by SH94 for the unresolved XRB since we detect sources below 
their threshold (see Paper II) . SH94 gave a 2a fimit of 30-35% for the fraction of unresolved 
sources contributing to the XRB that could be clustered like quasars based on analysis of 
images with a typical fiux fimit of about 1 x 10~^^ ergs cm"^. Above this threshold ~ 40% 
of the soft XRB is resolved into point sources. The sources in the fiux range 1.5 — 10 x 10^^^ 
ergs s~^ cm~^ (i.e. between the typical sensitivities of the deepest field and SH94) comprise 
~ 30% of the XRB. If these faint sources are clustered not significantly less than the major- 
ity of our sources (around a fiux of 10"^"^ ergs s~^ cm~^), a large fraction of the maximum 
correlation allowed by SH94's limit for the "residual" XRB must be accounted for by the 
sources we detect. If bright and faint sources exhibit the same clustering, SH94's 2a up- 
per fimit for the correlation in the "unresolved" XRB is produced. In this case the XRB 
below 1.5 X 10~^^ ergs s~^ cm~^, which comprises ~ 35% of the total XRB, should exhibit 
essentially no clustering since we have already observed the quasar-like, clustered population 
allowed by the SH94 fimit. 
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5. CONCLUSIONS 

We report on the first detection of a positive angular correlation derived from the dis- 
tribution of individual, faint X-ray sources. This correlation is detected with high statistical 
significance on both small 1') and moderate (I'-IO') angular scales. The two-point angu- 
lar correlation function is well described by a power law {9/00)^^"', with 7 = 1.7 ± 0.3 which 
is remarkably close to the corresponding value for normal galaxies and optically selected 
quasars. We show that the amplitude of correlation is also consistent with the hypothesis 
that X-ray sources are spatially distributed in the same way as optically selected quasars. 
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Figure Captions 



Fig. 1. — Angular two-point correlation function of X-ray sources. The best fit power 
law model {9/9qY~'^ is shown by the dashed fine. The power law slope is relatively well 
constrained and 7 is remarkably close to that for normal galaxies and optically selected 
quasars (1.8). 

Fig. 2. — Comparison of the simulated and measured ACF. The source positions were 
simulated so that the ACF of detected sources in simulations (open circles) is equal to that of 
sources detected in the real data (fit is shown by the solid line). The ACF of input positions 
is shown by small solid squares; the dashed line is the power law 9~^'^ fit to the input ACF. 
The shapes of the input and measured ACF are the same, while the latter has a factor of 
3 higher normalization. To verify that the difference is due to the amplification bias, the 
input source positions were convolved with the PSPC PSF, and the ACF of peaks in the 
convolved pattern has been measured (large solid circles). 

Fig. 3. — The 90% confidence intervals for the correlation length at 2; = 1.5 (in comov- 
ing coordinates) and the evolution parameter e. Assuming 1) a spatial correlation function of 
the form ^(r, z) = (r/ro)"''' {1 + z)~^~^ , 2) a source luminosity function and evolution that of 
X-ray selected quasars as given by Boyle et al. (1993), and 3) Q = 1, the angular correlation 
function was calculated for two values of the clustered population fraction / = 0.6, 1.0 and 
compared with the measured ACF (Fig. 1) by means of a x^-test. The grey-shaded region 
shows the range of values of Xq measured for optically selected quasars (Shanks & Boyle 
1994). X-ray and optical data are in good agreement if the QSO fraction is 60%-80%, close 
to what is observed. 
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